Grammar fragment acquisition using syntactic and semantic clustering
نویسندگان
چکیده
A new method for automatically acquiring grammar fragments for understanding uently spoken language is proposed. The goal of this method is to generate a collection of grammar fragments each representing a set of syntactically and semantically similar phrases. First phrases observed frequently in the training set are selected as candidates. Each candidate phrase has three associated probability distributions: of succeeding contexts, of preceding contexts, and of associated machine actions. The similarity between candidate phrases is measured by applying the Kullback-Leibler distance to three probability distributions. Candidate phrases which are close in all three distances are clustered into a grammar fragment. This approach detected 246 phrases in the test-set that were not present in the training-set. Experimental results show that a 3% improvement in the call-type classi cation performance has been achieved by introducing these fragments. key words spoken understanding, preceding and succeeding contexts, Kullback-Leibler distance, phrase similarity, phrase clustering
منابع مشابه
Feature Engineering in Persian Dependency Parser
Dependency parser is one of the most important fundamental tools in the natural language processing, which extracts structure of sentences and determines the relations between words based on the dependency grammar. The dependency parser is proper for free order languages, such as Persian. In this paper, data-driven dependency parser has been developed with the help of phrase-structure parser fo...
متن کاملAcquisition of English Prenominal and Postnominal Genitives
This study examined the acquisition of prenominal and postnominal genitives by Iranian EFL learners. Two variables were considered: possessive categories and language proficiency. We considered the influence of possessive categories such as lexical modifier, semantic relationship, and weight and syntactic complexity on genitive alternations by Iranian EFL learners. Also, we examined whether the...
متن کاملEmergent Functional Grammar for Space
This chapter explores a semantics-oriented approach to the origins of syntactic structure. It reports on preliminary experiments whereby speakers introduce hierarchical constructions and grammatical markers to express which conceptualization strategy hearers are supposed to invoke. This grammatical information helps hearers to avoid semantic ambiguity or errors in interpretation. A simulation s...
متن کاملSemiautomatic Acquisition of Semantic Structures for Understanding Domain-Specific Natural Language Queries
ÐThis paper describes a methodology for semiautomatic grammar induction from unannotated corpora of information-seeking queries in a restricted domain. The grammar contains both semantic and syntactic structures, which are conducive to (spoken) natural language understanding. Our work aims to ameliorate the reliance of grammar development on expert handcrafting or on the availability of annotat...
متن کاملImproving Verb Clustering with Automatically Acquired Selectional Preferences
In previous research in automatic verb classification, syntactic features have proved the most useful features, although manual classifications rely heavily on semantic features. We show, in contrast with previous work, that considerable additional improvement can be obtained by using semantic features in automatic classification: verb selectional preferences acquired from corpus data using a f...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Speech Communication
دوره 27 شماره
صفحات -
تاریخ انتشار 1998